Ensemble Methods for MiRNA Target Prediction from Expression Data
نویسندگان
چکیده
BACKGROUND microRNAs (miRNAs) are short regulatory RNAs that are involved in several diseases, including cancers. Identifying miRNA functions is very important in understanding disease mechanisms and determining the efficacy of drugs. An increasing number of computational methods have been developed to explore miRNA functions by inferring the miRNA-mRNA regulatory relationships from data. Each of the methods is developed based on some assumptions and constraints, for instance, assuming linear relationships between variables. For such reasons, computational methods are often subject to the problem of inconsistent performance across different datasets. On the other hand, ensemble methods integrate the results from individual methods and have been proved to outperform each of their individual component methods in theory. RESULTS In this paper, we investigate the performance of some ensemble methods over the commonly used miRNA target prediction methods. We apply eight different popular miRNA target prediction methods to three cancer datasets, and compare their performance with the ensemble methods which integrate the results from each combination of the individual methods. The validation results using experimentally confirmed databases show that the results of the ensemble methods complement those obtained by the individual methods and the ensemble methods perform better than the individual methods across different datasets. The ensemble method, Pearson+IDA+Lasso, which combines methods in different approaches, including a correlation method, a causal inference method, and a regression method, is the best performed ensemble method in this study. Further analysis of the results of this ensemble method shows that the ensemble method can obtain more targets which could not be found by any of the single methods, and the discovered targets are more statistically significant and functionally enriched. The source codes, datasets, miRNA target predictions by all methods, and the ground truth for validation are available in the Supplementary materials.
منابع مشابه
Systematic enrichment analysis of microRNA expression profiling studies in endometriosis
Objective(s): The purpose of this study was to conduct a meta-analysis on human microRNAs (miRNAs) expression data of endometriosis tissue profiles versus those of normal controls and to identify novel putative diagnostic markers. Materials andMethods: PubMed, Embase, Web of Science, Ovid Medline were used to search for endometriosis miRNA expression profiling studies of endometriosis. The miRN...
متن کاملPathway Analysis of miRNA-1 and Its Expres-sion Evaluation in Donor’s Serum from HIV-Positive Individuals vs Unaffected Controls
Background MicroRNAs (miRNAs) are non-coding RNA molecules (19-24 nucleotides) that play a major role in a wide range of biological processes through post-transcriptional regulation of gene expression. Differential expression of miRNAs has been reported in various infectious diseases such as HIV infection. The characterization of miRNA expression profiles, especially in mammalian biofluids, whi...
متن کاملLAT-derived microRNAs in HSV-1 target SMAD3 and SMAD4 in TGF-β/Smad signaling pathway
Background: During its latent infection, HSV-1 produces only a miRNA precursor called LAT, which encodes six distinct miRNAs. Recent studies have suggested that some of these miRNAs could target cellular mRNAs. One of the key cell signaling pathways that can be affected by HSV-1 is the TGF-β/Smad pathway. Herein, we investigated the potential role of the LAT as well as three LAT-derived miRNAs ...
متن کاملAn Enterovirus-Like RNA Construct for Colon Cancer Suicide Gene Therapy
Background: In gene therapy, the use of RNA molecules as therapeutic agents has shown advantages over plasmid DNA, including higher levels of safety. However, transient nature of RNA has been a major obstacle in application of RNA in gene therapy. Methods: Here, we used the internal ribosomal entry site of encephalomyocarditis virus and the 3’ non-translated region of Poliovirus to design an en...
متن کاملO-3: Drug Repositioning by Merging Gene Expression Data Analysis and Cheminformatics Target Prediction Approaches
The transcriptional responses of drug treatments combined with a protein target prediction algorithm was utilised to associate compounds to biological genomic space. This enabled us to predict efficacy of compounds in cMap and LINCS against 181 databases of diseases extracted from GEO. 18/30 of top drugs predicted for leukemia (e.g. Leflunomide and Etoposide) and breast cancer (e.g. Tamoxifen a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 10 شماره
صفحات -
تاریخ انتشار 2015